Question Analysis Report

Generated: 2025-07-03T22:08:06.009037

Executive Summary

Dataset Size:
9,098 observations
Features:
478 total
Models Analyzed:
7 outcomes
Best R²:
0.142

Model Performance Summary

Outcome Intercept Adj. R² F-statistic F p-value AIC BIC RMSE N Significant Features High VIF Features Mean VIF Max VIF Sample Size
news_proportion_left_leaning 18.3409*** 0.1201 0.1155 26.28 0.0000 88062.8 88404.4 30.5117 21 0 1.57 3.75 9,098
news_proportion_right_leaning 2.6151*** 0.0581 0.0533 11.89 0.0000 68057.1 68398.6 10.1620 18 0 1.57 3.75 9,098
news_proportion_center_leaning 78.8911*** 0.1416 0.1371 31.76 0.0000 88679.1 89020.7 31.5629 22 0 1.57 3.75 9,098
news_proportion_unknown_leaning 0.1529 0.0137 0.0086 2.67 0.0000 58256.7 58598.3 5.9302 7 0 1.57 3.75 9,098
news_proportion_high_quality 70.6175*** 0.1177 0.1131 25.69 0.0000 90344.1 90685.6 34.5871 23 0 1.57 3.75 9,098
news_proportion_low_quality 5.6757*** 0.0459 0.0409 9.26 0.0000 76386.7 76728.2 16.0615 15 0 1.57 3.75 9,098
news_proportion_unknown_quality 23.7069*** 0.1296 0.1251 28.67 0.0000 89093.3 89434.9 32.2895 22 0 1.57 3.75 9,098

Correlation Matrix

Feature Importance

Regression Coefficients by Outcome

news_proportion_left_leaning (R² = 0.120, 49 features)

news_proportion_right_leaning (R² = 0.058, 49 features)

news_proportion_center_leaning (R² = 0.142, 49 features)

news_proportion_unknown_leaning (R² = 0.014, 49 features)

news_proportion_high_quality (R² = 0.118, 49 features)

news_proportion_low_quality (R² = 0.046, 49 features)

news_proportion_unknown_quality (R² = 0.130, 49 features)

Model Family Comparisons

proportion_left_leaning

proportion_right_leaning

proportion_high_quality

proportion_news

num_citations

Multicollinearity Diagnostics

Interpretation: Variance Inflation Factor (VIF) measures multicollinearity.

news_proportion_left_leaning (High VIF: 0, Mean VIF: 1.57)

news_proportion_right_leaning (High VIF: 0, Mean VIF: 1.57)

news_proportion_center_leaning (High VIF: 0, Mean VIF: 1.57)

news_proportion_unknown_leaning (High VIF: 0, Mean VIF: 1.57)

news_proportion_high_quality (High VIF: 0, Mean VIF: 1.57)

news_proportion_low_quality (High VIF: 0, Mean VIF: 1.57)

news_proportion_unknown_quality (High VIF: 0, Mean VIF: 1.57)

Summary Statistics

Variable Type Mean Std Min Max N Missing
num_citations Citation Outcome 5.7652 5.1669 0.0000 46.0000 32,400 0
proportion_high_quality Citation Outcome 8.9662 21.3873 0.0000 100.0000 32,400 0
proportion_left_leaning Citation Outcome 1.6659 7.4564 0.0000 100.0000 32,400 0
proportion_right_leaning Citation Outcome 0.0819 1.2404 0.0000 50.0000 32,400 0
news_proportion_high_quality Citation Outcome 21.8282 39.9889 0.0000 100.0000 32,400 0
news_proportion_left_leaning Citation Outcome 4.7865 18.8207 0.0000 100.0000 32,400 0
news_proportion_right_leaning Citation Outcome 0.3746 5.5664 0.0000 100.0000 32,400 0
proportion_news Citation Outcome 10.7818 23.2094 0.0000 100.0000 32,400 0
turn_number Question/Response Feature 1.7057 2.0636 1.0000 39.0000 32,400 0
total_turns Question/Response Feature 2.5335 3.5807 1.0000 50.0000 32,400 0
question_length_chars_log Question/Response Feature -0.0000 1.0000 -3.8234 2.6358 32,400 0
question_length_words_log Question/Response Feature 0.0000 1.0000 -2.2585 2.9189 32,400 0
response_length_log Question/Response Feature -0.0000 1.0000 -7.0885 3.1220 32,400 0
response_word_count_log Question/Response Feature -0.0000 1.0000 -5.6188 2.9660 32,400 0
model_family_google Model Family 7,563 observations 23.3% - - 32,400 0
model_family_openai Model Family 11,168 observations 34.5% - - 32,400 0
model_family_perplexity Model Family 13,669 observations 42.2% - - 32,400 0

Technical Details

Regression Method: OLS_statsmodels

PCA Precomputed: True

PCA Used: True

Total Features: 48